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Abstract 

We consider a variant of the well-studied gossip-based model of communication for disseminating 
information in a network, usually represented by a graph. Classically, in each time unit, every node u 
is allowed to contact a single random neighbor v. If u knows the data (rumor) to be disseminated, node 
v learns it (known as push) and if node v knows the rumor, u learns it (known as pull). While in the 
classic gossip model, each node is only allowed to contact a single neighbor in each time unit, each node 
can possibly be contacted by many neighboring nodes. If, for example, several nodes pull from the same 
common neighbor v, v manages to inform all these nodes in a single time unit. 

In the present paper, we consider a restricted model where at each node only one incoming request 
can be served in one time unit. As long as only a single piece of information needs to be disseminated, 
this does not make a difference for push requests. It however has a significant effect on pull requests. If 
several nodes try to pull the information from the same common neighbor, only one of the requests can 
be served. In the paper, we therefore concentrate on this weaker pull version, which we call restricted 
pull. 

We distinguish two versions of the restricted pull protocol depending on whether the request to be 
served among a set of pull requests at a given node is chosen adversarially or uniformly at random. 
As a first result, we prove an exponential separation between the two variants. We show that there 
are instances where if an adversary picks the request to be served, the restricted pull protocol requires 
a polynomial number of rounds whereas if the winning request is chosen uniformly at random, the 
restricted pull protocol only requires a polylogarithmic number of rounds to inform the whole network. 
Further, as the main technical contribution, we show that if the request to be served is chosen randomly, 
the slowdown of using restricted pull versus using the classic pull protocol can w.h.p. be upper bounded 
by 0(A/<5 • logn), where A and S are the largest and smallest degree of the network. 



1 Introduction 


Gossip-based communication models have received a lot of attention as a simple, fault-tolerant, and in 
particular also scalable way to communicate and disseminate information in large networks. The classic 
application of gossip-based network protocols is the spreading of information in the network, specifically 
the problem of broadcasting a single piece of information to all nodes of a network, in this context also often 
known as rumor spreading, e.g., [5, 6,8,10,11 , 16]. On top of this, gossip-based protocols have for example 
also been proposed for applications such as maintaining consistency in a distributed database [6], for data 
aggregation problems [ , 17, 18], or even to run arbitrary distributed computations [2], 

The best studied gossip strategy is the random phone call model, which was first considered in [10]. 
We are given a network graph G = ( V. E) where initially a source node s € V knows some piece of 
information (rumor) and the objective is to disseminate the rumor to all nodes of G. Typically, time is 
divided into synchronized rounds, where in each round, every node can contact a random neighbor and if 
u contacts v, an interaction between u and v is initiated for the current round. For spreading a rumor, two 
basic modes of operation are distinguished. Nodes that already know the rumor can PUSFI the information 
to the randomly chosen neighbor [10] or nodes that do not yet know the rumor can PULL the information 
from the randomly chosen neighbor [6]. In much of the classic work, the network G is assumed to be a 
complete graph. In that case, it is not hard to see that PUSH and PULL both succeed in O(logn) rounds 
and that the total number of interactions of each node can also be bounded by O(logn). In [16], it is shown 
that when combining PUSH and PULL (in the following referred to as PUSH-PULL), the average number 
of interactions per node is only © (log log n). 

Mostly in recent years, PUSH, PULL, and PUSH-PULL have also been studied for more general 
network topologies, e.g., [3, 5,8,9,1 1-13], with [5,1 1] and[12,13] studying the time complexity as a function 
of the graph’s conductance and vertex expansion, respectively. E.g., in [ 1], it is shown that with high 
probability (w.h.p.), the running time of PUSH-PULL can always be upper bounded by O ((log n)/0(G)), 
where n is the number of nodes and 0(G) is the conductance of the network graph G. 

While in gossip protocols, each node can initiate at most one interaction with some neighbor, even if 
each node contacts a uniformly random neighbor, the number of interactions a node needs to participate in 
each round can be quite large. In complete graphs and more generally in regular graphs, the total number of 
interactions per node and round can easily be upper bounded by O(logn). However in general topologies 
a single node might be contacted by up to @(n) neighboring nodes. As an extreme case, consider a star 
network where a single center node is connected to n — 1 leaf nodes. Even if the rumor initially starts at a 
leaf node, PUSH-PULL manages to disseminate the rumor to all nodes in only 2 rounds. Clearly, in these 
2 rounds, the center node has to interact with all n — 1 leaf nodes. In fact, all recent papers which study the 
time complexity of the random PUSH-PULL protocol critically rely on the fact that a node can be contacted 
by many nodes in a single round, e.g., [11]. In some cases, this behavior might limit the implementability 
and thus the applicability of the proven results for this gossip protocol. In order to obtain scalable systems, 
ideally, we would like to not only limit the number of interactions each node initiates, but also the number 
of interactions each node participates in. 

In the present paper, we therefore study a weaker variant of the described random gossip algorithms. 
In each round, every node can still initiate a connection to one uniformly random neighbor. However, if 
a single node receives several connection requests, only one of these connections is actually established. 
When disseminating a rumor by using the PUSH protocol, this restriction does not limit the progress of the 
algorithm. In a given round, a node v learns the rumor if and only if at least one PUSH request arrives at 
v. However, when using the PULL protocol, the restriction can have a drastic effect. If a node v receives 
several PULL requests from several nodes that still need to learn the rumor, only one of these nodes can 
actually learn the rumor in the current round. In our paper, we therefore concentrate on the PULL protocol 
and we define RPULL (restricted PULL) as the described weak variant of the PULL algorithm: In each 
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RPULL round, every node that still needs to learn the rumor contacts a random neighbor. At every node 
that knows the rumor, one of the incoming requests (if there are any) is selected and the rumor is sent to the 
corresponding neighbor. By PUSH-RPULL we denote the combination of RPULL with a simultaneous 
execution of the classic PUSH protocol. 

Contributions We first consider two versions of the RPULL protocol which differ in the way how one 
of the incoming requests is selected. Assume that in a given round some informed node v receives RPULL 
requests from a set of neighbors R v . In the adversarial RPULL protocol, an (adaptive) adversary picks 
some node u E II, } which will then learn the rumor. In the random RPULL protocol, we assume that a 
uniformly random node u E R v learns the rumor (chosen independently for different nodes and rounds). 
While the choice of which neighbor a node (actively) contacts with a request is under the control of the 
protocol, it is not necessarily clear how one of the incoming requests in R v is chosen. If the node can 
only answer one request per time unit and the requests do not arrive at exactly the same time, the first 
request might be served and all others dropped. Or even if requests arrive at the same time, it might be the 
underlying network infrastructure or operating system which picks one request and drops the others. If it is 
reasonable to assume that the incoming requests are served probabilistically and independently, we believe 
that random RPULL provides a good model. Otherwise, the adversarial assumption allows to study the 
worst-case behavior. 

As a first result, we show that there are instances for which there is an exponential gap between the 
running times of the two RPULL variants. We give an instance where for every source node the random 
RPULL protocol informs all nodes of the network in polylogarithmic time, w.h.p., whereas, for every 
source, the adversarial RPULL algorithm requires time O(y'n) to even succeed with a constant probability. 

In the second part of the paper, we have a closer look at the performance of the random RPULL pro¬ 
tocol. Consider a graph G and let 5 and A denote the smallest and largest degree of G. In each round, in 
expectation, each informed node receives at most A/S requests. Hence, if an uninformed node u sends an 
RPULL request to an informed node, u should receive the rumor with probability at least £1(5/A). Conse¬ 
quently, intuitively, the slowdown of using random RPULL instead of the usual PULL protocol should not 
be more than 0(A/5). 1 We prove that this intuition is correct. For every given instance, we show that if 
the PULL algorithm informs all nodes in T rounds with probability p, for the same instance, the random 
RPULL algorithm manages to reach all nodes in time 0(T■ ^ ■ log n) with probability (1 — o(l))p. 2 While 
the statement might seem very intuitive, its formal proof turns out quite involved. Formally, we prove a 
stronger statement and show that a single round of the PULL protocol is w.h.p. stochastically dominated by 
0(y • logn) rounds of random RPULL in the following sense. We give a coupling between the random 
processes defined by PULL and random RPULL such that for every start configuration, w.h.p., the set of 
nodes informed after 0(j L ■ log n) rounds of random RPULL is a superset of the set of nodes informed in 
a single PULL round. The same holds for simulating one round of PUSH-PULL with PUSH-RPULL. A 
similar coupling between rumor spreading algorithms has been done in [ ] where the authors couple log(n) 
rounds of asynchronous- with one round of synchronous PUSH-PULL. A coupling between PULL and 
RPULL in the classic sense, i.e., a coupling which does relinquish the w.h.p. term does not exist. We also 
show that for such a round-by-round analysis, our bound is tight. That is, there are configurations where 
H (4 log n) random RPULL rounds are needed to dominate a single PULL round with high probability. 

Notation and Preliminaries Let G = (V, E) be the n-node network graph. For a node u E U, we use 
N ( u ) to denote the set of neighbors of u and d u = d(u) := | N(u) | to denote its degree. Given a set of nodes 
S C V, we define Ns(u ) := N(u) Cl S to be the set of ids neighbors in S and ds(u ) := | ATgr(t/)| for the 

'Here O hides log(n) factors. 

“Actually, y can be replaced by max{ Ui „} 6B deg(u)/ deg(u) in all parts of the paper. 
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number of neighbors of u in S. The smallest and largest degrees of G arc denoted by 5 and A, respectively. 
For a set V' C V we denote with G\V'] the graph induced by V'. To indicate a disjoint union of two sets, 
i.e., Au B with A n B = 0, we write AdB. For a set of natural numbers {1,..., k} we only write [&]. 

When analyzing the progress of an algorithm, we typically use S to be the set of initially informed nodes 
and U to be the initially uninformed nodes. Given some algorithm ALG, the set .S' / AL(i denotes the set of 
informed nodes after t rounds of ALG when starting with the set S ( ALG := S of informed nodes. 

2 Separation of Adversarial and Random RPULL 

We want to show that the adversarial RPULL can be exponentially slower than the randomized RPULL on 
general graphs. To show this, we first establish results on the run time of both algorithms on trees. These 
results might also be of independent interest. 

In a tree network let p V)U = (v = vo,v\, V 2 ..... v q = u) denote the unique path from v to u, though 
we use that notation also for the set of nodes on that path, i.e., p v>u = {v,v\,V 2 , ■ ■ ■ ,v q -i , it}. Define 
D p := Ywep d w , he., the sum of all degrees on the path p. 

The next lemma shows that on a tree any form of RPULL is asymptotically as fast as PULL plus an 
additive term in the order of the degree of the node that initially has the rumor. 

Lemma 2.1. Let G be a tree network with ,S’o = {r} and let u be a node in Uq. Furthermore, let r be the 
first round in which u £ S T holds, i.e., the number of rounds until u gets informed. 

(1) E[r] = Q(D Pr u - d r ) for PULL, 

(2) E[t] = Ll(D Pr u — d r ) for every type o/RPULL, 

(3) E[t] = 0(D Pr u ) for adversarial RPULL. 

Proof of Lemma 2.1. (1) We root the tree at the only informed node, r. Note that nodes are not aware of 
their own parent/child relationships. Consider some time t at which a node r' on the path p rjl is in St \ St- 1 , 
i.e., it just got informed. Thus its child u! £ p r . u on the path is not yet informed, i.e., v! £ U L . In any round 
tl > t , in which v! is not informed yet, it requests its parent with probability 1 / dp. Thus each uninformed 
node u! £ p rM \ {r} on the path needs 0( dp) rounds in expectation before it can get informed. Linearity 
of expectation proves the claim for PULL. 

(2) follows from the fact that RPULL is at most as fast as PULL. 

(3) For adversarial RPULL divide all rounds t’ > t in which u' is not yet informed into two types: 

First rounds in which at least one sibling of u ', i.e., the nodes in N(r')\{u'}, requests from r' and secondly 
rounds in which no sibling of u' requests from r'. The first type of rounds is upper bounded by dp because 
every neighbor of r' stops requesting after receiving the rumor. In expectation u! gets the rumor after dp 
rounds of type two; thus in expectation v! is informed within 0(dp + dp) rounds. Applying this recursively 
to all uninformed nodes on the path p T)U , we get the claimed result via lineality of expectation. □ 

Lemma 2.2. Let G be a tree network with So = {?’}. Then in both random and adversarial RPULL it takes 
0( ma x pa thp Dp + Alogn) rounds to fully inform all nodes in V, w.h.p.. 

Proof of Lemma 2.2. The analysis we use does hold for adversarial RPULL. 

First look at a path p = p r .i = (r = vq,vi,V 2 , ■ ■ ■ ,v q = l) from the root to some leaf l and define 
A p = niaXj.cp d v . Let T, be the random variable that indicates the round in which node v, gets informed 
and with To := 0 we can define X, := T t — T:-i for m = 1 ,,q, the time node t;,_| needs to pass the 
information forward to node v\. For simplicity in the following we define u := v t and s := Vj- \. 

Once s gets informed, a round is called free if no node in N(s) \ { u \ requests to s, otherwise it is 
called congested. In a free round, u gets the rumor with probability at least l/d u , i.e., the number of free 
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rounds is upper bounded by a geometric random variable T/g-ee ~ Georn(l/d v ). On the other hand, in a 
congested round, at least one neighbor of s does get informed, so there can not be more than d s such rounds. 

In total we get that P( X t > r) < P( Y.fi-cc + d s > t) for node u = Vi and with T q = J2']= \ 2Q we 
conclude that P (T q > r) < P(£f =1 Y i, free + Ef= t d Vi _ 1 > r). 

Since we arc on a tree, those geometric random variables are all independent, and we can apply the 
Chernoff Lemma B.l. Let A p be the largest degree of all nodes on the path excluding r, i.e., p\ = 1 / A p 
in terms of the notation from Lemma B.L With Y = J2i= 1 Yi.free we have // = D p — d r > A p and we set 
t = cA p log n for some c > 0. 


_ p, _ cA p log n 

P(Y > 3(/i + i)) < e 2A Pe a p <n~ c , 

i.e., w.h.p., v q is informed within 0(D P + A p log n) = 0(max pa ,h p D p + Alogn) rounds. In a tree there 
are at most n root-leaf paths, therefore a union bound over all individual paths concludes the proof. □ 

Lemma 2.2 shows that random RPULL and adversarial RPULL are essentially the same on trees. This 
does not hold for general graphs. 

Lemma 2.3. There is a graph G = (V, E ) of size 0(n) with node r c <G V, deg(r Q ) < 3, such that: 

• For So = {r a }, w.c.p., the run-time of adversarial RPULL is in S2(y Yi). 

• For any non-empty So C V, w.h.p., the run-time of randomized RPULL is in 0(log 2 n). 

Proof of Lemma 2.3. A picture depicting the graph can be found in Appendix A. 

We use the notation w.h.p.(A;) to say that some event holds with high probability with respect to k, i.e., 
with probability at least 1 — 1 /k. 

We first introduce a graph type, with size depending on some parameter k, that we call a /r-lcal'-conncctcd 
tree (A'-LCT). In simple words, a fc-LCT is a binary tree with k leaves, but with those k leaves being 
fully interconnected, i.e., forming a clique. Mathematically more precise, a graph H = (L>UL, EpddEjf 
with \L\ = k and \B\ = k — 1 is a A;-LCT, iff //^ = (L. Ej) is a complete graph over k nodes and 
Hb = ( B'dL. Eb) is a complete binary tree with its leaves being the nodes in L. While H is not strictly 
speaking a tree, we call nodes in L its leaves, L(H) = L the leaf set, B(F[) = B its branch set and the 
root of Hb we call the root of H. Also, every node in H except for its root has a clearly designated parent 
(defined by lip) and each node in B has two clearly designated children (with respect to the root in Hb). 

Claim 2.4. Let G = (V. E) be a graph and H = {Vp. Eh) be a subgraph ofG that is a k-LCT. Further¬ 
more, let any node v 6 Vh have at most one connection outside of Vp, i.e., deg G (n) < deg^(n) + 1. Then, 
w.h.p. [k), for any non-empty set of nodes in H knowing the rumor, randomized RPULL informs all nodes 
in H within 0(log k ) rounds. 

Without loss of generality let there be one node s having the rumor. If s 6 B := B(H), 
i.e., deg G (s) < deg^(s) + 1 < 4, then we can apply Lemma 2.2 to get that all nodes in B 
arc informed within O(logZs) rounds. Let this be the case. All nodes in L := L(H) have 
degree at most k + 1: k — 1 neighbors in L, one "parent-node” in B and at most one neighbor 
in V \ Vh- Each of them requests to its neighboring parent from B with probability at least 
1 /(k + 1), i.e., in each round, with probability at most (l — < 1/2, no node in L learns 

the rumor. By Chernoff, w.h.p.(A), after 0(log k) rounds, at least one node in L knows the 
rumor. If x > 0 nodes in L arc informed, then each uninformed node ii in L requests from one 
of those x nodes (or a node in B) with probability at least x/ik + 1) and with probability at least 
(1 — l/A;) />: 1 > 1/3 node a is the only node requesting from its target. As long as x < A/2, 
with linearity of expectation, each round the expected number of newly informed nodes in L 
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is in fl(x). Once x > k /2 we can use a similar argument to show that, w.c.p., the number of 
uninformed nodes goes down by a constant factor each round. Hence, after 0(log k) rounds in 
expectation, but also w.h.p.(A;), all nodes in L arc informed. 

If initially sGf, then with probability p G {1/4,1/3} its parent node in B requests from s, 
while at the same time with probability at least (1 — 1/A;) fc_1 > 1/3 no other node in L requests 
from s. Hence, after 0(log k) rounds, w.h.p., the parent node gets the rumor from s. The rest 
follows from reduction to the first case. □ 

We construct G = (Vi U V/, E) as follows. We let D a and Dq be two n-LCTs, and we have m Z-LCTs 
that we denote with Hi, D 2 ,. ■ ., D m , where l := y/n and m := Cy/n for some natural number c. We use the 
notation I), for the corresponding A-LCT and its node set interchangeably. Their coiTesponding roots and 
leaf sets are denoted as r a , r^, r\, ri, ■ ■ ■ r m and L a , L £, L 1 , L 2 , ... L m respectively, and with lx, 1 , lx, 2 , ■ ■ ■ 
we enumerate the leaves of leaf set Lx- Let C Q = {ci,... c m } be an arbitrary m-sized subset of D a ’s 
branch set B a - for simplicity and in accordance to Figure 1 think of C a as the layer of nodes in B a that arc 
at depth log m. 

We let V/ = H^ and V\ = {r} U D a U D\ U ... U D m and we add the following edges. 

• Between r and Dr : We add one edge from r to ry 

• Between r and D a : For each j G [mlogn] we add an edge from r to l a j. 

• Between r and D\,.... D rn : For each i G [m] and j G [log n] we add one edge from r to Zy. 

• Between Hi,..., D rn and H^: For each i G [m] we add one edge from n to Zy. 

• Between Hi,..., D rn and C a : For each i G [m] we add one edge from l.,j to c ? . 

Note that the degree deg(r) is 2m log n + 1 and that all the above defined edges add to any node in a LCT 
at most one edge that connects it to a node outside its own LCT. 

The idea of the proof is the following: The graph is built in a way that information propagation from 
V/ to V\ is quick, but not the other way round. In the random RPULL model, wherever the rumor starts, 
it reaches r quickly and from there y manages to get the rumor from r in polylogarithmic time. Then the 
rumor quickly propagates through V/ = 1)^, and from to all LCTs Hi,..., D m and afterwards to D a . 

In the adversarial RPULL model, as long as the rumor does not start in V/. the rumor can quickly spread 
to r, a few of the D,s and D a but not to V/ because we let the adversary always prioritize a request at node 
r from a node in one of the D, s over a request from y to prevent that ry will get the rumor. This is possible 
because we show that for polynomially many rounds there is always a request at r from one of the H,s to 
serve. Thus, to inform Dr all information must go through one of the edges { r,, Zy }, i = 1,..., m, with r t 
informed. In less than a polynomial number of rounds few enough of the rys are informed and in each round 
only few requests from the leaf nodes request from one of the r,s at all making it unlikely that one of 
them requests from an informed r % . Hence propagation through one of these edges is unlikely and it takes a 
long time for the rumor to spread over the entire graph. 

Random RPULL. We start proving that random RPULL manages to spread the rumor quickly in G. 

(1) If there is an informed node in Dq, by 2.4, w.h.p., all of H^ is informed in O(logn) rounds. Assume 

this has happened. Since each root of a LCT I), has degree 3 in G, it requests the rumor from an 
informed leaf node in L c w.c.p.- since no other node in is still uninformed and therefore able to 
create a conflict, w.h.p., in 0(log n) rounds, all root nodes know the rumor. 

(2) If there is an informed node in I), for some i = 1 ,,m, due to 2.4, the whole LCT D, is informed 
w.h.p.(m) (=w.h.p.) within O(logm) = O(logn) rounds. Assume this has happened. Node c* G C a 
has degree at most 4 and therefore requests from its neighboring node w.c.p., and since all nodes in 
I), are informed, it will also get the rumor. 

(3) If there is an informed node in D a , by 2.4, w.h.p., all of D a is informed in 0(log n) rounds. Assume 
this has happened. Almost half of all neighbors of r lie in L a , and with same reasoning as above, r gets 
the rumor w.h.p. within 0(log n) rounds. 
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(4) Let r be informed. All its neighbors in L a have degree n + 1 and therefore request with probability at 
most 1/n from r, i.e., in expectation no more than 1 node from there requests the rumor from r each 
turn. Each neighboring leaf node in some L, has degree m + 1, i.e., requests the rumor from r with 
probability at most 1 /to. Since r has m log n such neighbors, in expectation no more than log n such 
neighbors request from r. With a Chernoff bound, w.h.p. there are no more than O(logn) requests at 
r. Since has degree 3, it therefore requests w.c.p. and gets the rumor with probability 17(1/log n). 
W.h.p., the rumor is therefore propagated to ry in 0( log 2 n) rounds. 

Altogether, wherever the source node is located, the above reasoning shows that, w.h.p., the rumor is propa¬ 
gated to all nodes within 0( log 2 n) rounds. 

Adversarial RPULL. Let s G V be the source node with the rumor. If s G t), for some i G [to], 
then, without loss of generality, we assume that all nodes in D*, D a and r are already informed, initially. 
Otherwise we inform all nodes in D a and r. For i G [to] we call any I), informed , if it contains at least one 
informed node, otherwise uninformed. 

The adversary has the following simple strategy. If ry and at least one other node requests the rumor 
from r, then r chooses to pass the rumor to any other node than ry. In every other aspect it follows an 
arbitrary strategy. 

For time t we denote with X t the number of informed FCTs I), . and we assume without loss of generality 
that the corresponding FCTs are D i,..., Dx t - Let £ t be the event that in round t no node in Dq has the 
rumor. Conditioning on this event implies that, by the structure of our graph and our model, nodes from 
LCTs Di need to get the rumor from either r or from D a , via connections { c t , /,„/}. 

Let X t be the event that X t < Act + 2 log n. At the event that ry gets the rumor in round t and let Ct be 
the event that a node from Ly gets the rumor from one of the roots r % . 

Claim 2.5, P(A/| £f) >1 — 1 /n > e~ 1 ! n for any t < l/5a 

In each round, r can inform at most one node in a yet uninformed LCT !),. Also, any 
uninformed node Itj connects to its neighbor c t in D a only with probability 1/(1 + 1) < l/l. 

With at most rn such uninformed nodes trying to get the rumor from D a each round, the amount 
of nodes = 1,..., to informed through such an edge is upper bounded by a Binomial 
random variable Bin(td, l/l). Let X' t be the random variable that counts the number of times 
when an uninformed LCT I), gets informed through such an edge to D a but not through a 
connection to r. Then, by Chernoff, for 5 = (3 — + 2^^ > 1, 

P (Xt > t + ct + Sct\£t) < P(A/ > (1 + 5)ct) < exp —<51n(l + <5)^ < n~ l . 

Therefore, w.h.p., X t is smaller than Act + 2 log n for t < l/5c. □ 

Claim 2.6. ~P(At+\\£t Cl Xf) < 1/n < 1 — e ~ 1 ! n for any t < l /5c. 

Every uninformed node fj G L t , where j G [logn] and i G [to], requests from r with 
probability 1/(1 + 1). Choosing c large enough X t implies that at least m/2 of the I),s are 
uninformed. Thus there arc at least (to/ 2) • logn uninformed leaf nodes with a connection 
to r in uninformed FCTs At least 0.4c logn such nodes request from r in expectation. 
Choosing c large enough, a simple Chernoff bound gives us that, w.h.p., at least one of these 
nodes requests from r. Consequently, w.h.p., r does not give the rumor to ry in round t + 1. □ 
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Claim 2.7. P(C t+ ] \£ t n X t ) < 1 — e ^ 2ct+logn ' > for any t < l/5c. 


By our assumption of £ t , at the start of round i +1, no node in I)q has the rumor, so for Ct +1 
to possibly happen, a node from must request from one of the nodes r\,, rx t , which it 
does with probability l/(n + 1) < 1/n. The probability for Ct+i to happen is therefore 


P(C*+i|£i n x t ) < l 


( i\ Xt 

\ _ n/ - 1 ” eXP 



(2 ct + log n] 


□ 


We know that At+i U C t+ \ n £/ n X t C £ t+1 since if under condition £ t neither At+i nor Cf + \ happens, 
then no node in D^ can get informed in round t+l. 

Claim 2.8. P(£ t ) > e~^ 2ct2+tlog ^ for any t < l/he. 


The proof follows by induction. In round t = 0 clearly no node in I)q is informed, so the 
induction base holds. For the following, note that conditioned on £t, events At+i and Ct+i (and 
therefore also their complements) are independent. 


P(£r+i) > P(»4i+i u Ct+i n £t n xf) — P (£t n Xf) P(.4t+i n Ct+i\£t. n Xf) 

> P (St) p (Xt\£ t ) P(A^~i\£t n Xt) P(C^~i\£t n x t ) 

l, 9 , x l l i, , 

— (2 ct 2 + t log n) H-1-1— (2ct + log n) 

n n n n 

> exp (/2c(t + l) 2 + (t + 1) logn)^ . □ 

This means, that after t = \Jn/c rounds with probability at least e :i still not all @(n) nodes in G arc 
informed, concluding the proof of Lemma 2.3. □ 

Theorem 2.9. There is a graph G = (V, E) of size @(n), such that for any So = {s } C V: 

• In expectation, the run-time of adversarial RPULL is in fl(i/rr). 

• W.h.p., the run-time of randomized RPULL is in 0(log 2 n). 

Proof. Let G' and G" be duplicates of the graph G from Lemma 2.3, r' a and r" being the respective dupli¬ 
cates of r a . We set G := G 1 U G" and add the edge {r' Q , r'/}. Without loss of generality let s £ V. 

In the random version, the rumor propagates through all of G' in 0(log 2 n) rounds. Due to its low 
degree, r'/ gets the rumor from r' a within 0(log n) time after G' is informed and again, in 0( log 2 n) rounds 
G" is informed completely. 

In the adversarial version, G" can only learn the rumor from G' through edge {r' fy , r'/}. But once r" a 
knows the rumor, we can apply Lemma 2.3 again to prove that now progress is stalled. □ 

3 Comparison of PULL and RPULL 

In this section we compare the two algorithms PULL and random RPULL on general graphs, i.e., we 
analyze how many rounds of random RPULL are enough to cover the progress of one round of PULL. 
More precisely, we show that w.h.p. the set of nodes informed after 0(4 • log n) rounds of random RPULL 
is a superset of the set of nodes informed in a single PULL round. We manage to do so by coupling both 
algorithms. At the end of the section we head out to prove that this bound is tight. Whenever we talk about 
RPULL in this section we mean random RPULL. 



> exp 
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3.1 Dominance and Couplings 


We begin with two examples of insufficient definitions of domination between two rumor spreading algo¬ 
rithms. 

Showing for two algorithms A and A! that P (it € S'" 4 ) > P (u € S A ) holds for all a G U is not 
enough to obtain a natural dominance definition of A over A', since due to dependencies for a set M with 
\M\ > 1 it might still be true that P (M C S'" 4 ) < P (M C S A '). 

Showing that P (M C S A ) > P (M C S'- 4 ) (*) holds for all M C U is not enough either. Assume the 
following example: Let U = {a, b , c} be the set of uninformed nodes. Assume that under A the probability 
that the set of newly informed nodes equals {a, b , c}, {a}, {b} or {c} is 1/8 + e each and the probability 
that it equals one of the sets {a, b}, {a, c}, {6, c} or 0 is 1/8 — e each. Under A 1 we inform any of those sets 
with probability 1/8. A direct computation for all M C {a, b, c}, e.g., for M = {a}, P({a} C S A ') = 1/2 
and 

P(W c S A ) = P (S A = {a}) + P (S A = {a, b }) + P (S A = {a, c}) + P (S A = {a, b, c}) = 1/2, 

shows that inequality (*) is fulfilled for any M C U, but the probability of the event “at least 2 nodes 
are informed” is by 2e smaller for A than for A 1 . Hence, to cover most possibly arising cases, we use the 
definition of the so called (first order) stochastic dominance. 

Stochastic Dominance and Coupling 

Let ( S , As) be a finite distributed lattice and let X\ and X 2 be random variables with distributions Pi and 
P 2 which take values in S. A function / : S —>■ M is called increasing if A As B implies f(A) < f(B). 

Definition 3.1 (Stochastic Dominance). We say that X \ stochastically dominates X 2 if 

E(f(X 1 ))>E(f(X 2 )) 

holds for every increasing function f : S —)• M, where E(-) denotes the expected value. 

In this paper we will set S = 2 U to be the power set of U, where U C U is the set of uninformed nodes, 
As equals the subset relation on U and X\ and A '2 will be the respective random variables describing which 
nodes get informed in PULL and RPULL. 

Alternative to the definition, one can show that one process stochastically dominates a second process 
by defining a monotone coupling between the processes (cmp. Theorem 3.3). 

Definition 3.2 ((Monotone) Coupling). A coupling of two random processes X \ and X 2 , taking values in 
S with distributions Pi and P 2 , is a joint distribution P of a random process (X\,X 2 ) taking values in 
S x S, such that its margins stochastically equal the distributions of X\ and X 2 respectively, 

P ((Xi, X 2 ) = ( A, 5)) = Pi(Xi = A) MA G S and 
Bes 

J2 P ((^t, X 2 ) = (A, B)) = P 2 (X 2 = B) MB e 5. 
nes 

A coupling is called monotone (written X\ < X 2 ) if additionally the following holds: 

VA, B e S with P ^(Xi, X 2 ) = (A, B)^j > 0 it follows that A A 5 B. 

A coupling is called monotone w.h.p. (written X\ < w _h.p. X 2 ) if for some c > 1 it satisfies 


( 1 ) 



Colloquially speaking, having a monotone coupling between two rumor spreading processes means that 
one process is at least as effective as the other one in every possible aspect. More precisely, condition (1) 
says that if, in the joint distribution, there is a positive probability that process 1 informs exactly the nodes in 
A and process 2 informs the nodes in B, then process 2 will (at least) inform all nodes which arc informed 
by process 1. Condition (2) says that condition (1) holds with high probability. 

The following theorem, Strassen’s Theorem [7, 19], shows an equivalence between stochastic dominance 
and the notion of monotone couplings. 

Theorem 3.3 (Strassen). The following are equivalent: 

1. X\ stochastically dominates A' 2 , 

2. There exists a monotone coupling between X\ and X 2 such that X\ < X 2 , 

3. P (X\ E F) > P (A '2 G F) holds for every monotone set F C S. 3 

We want to show that O(^logn) rounds of random RPULL stochastically dominate one round of 
PULL. This, however, is not possible as one can easily construct a graph in which some node u is informed 
with probability 1 in one round of PULL, but with probability less than 1 in 0( 4 log n) rounds of RPULL. 4 
Hence a monotone coupling does not exist either. We therefore introduce the notion of highly probable 
stochastical dominance in analogy to the equivalencies from Strassen’s Theorem. 

Definition 3.4. X\ stochastically dominates X 2 with high probability, if there exists a coupling between X\ 
and X 2 that is monotone with high probability. 

3.2 W.h.p. Monotone Coupling between PULL and RPULL 

Theorem 3.5. W.h.p., for any set of informed nodes S C V, T = 0(j- log n) rounds of random RPULL 
stochastically dominate a single round of PULL. 

Corollary 3.6. If in a graph G with initially informed nodes S C V the PULL algorithm informs all nodes 
in T rounds with probability p, then the random RPULL algorithm informs all nodes in time 0(T■ f Tog n) 
with probability (1 — o(l))p. 

By PUSH — RPULL we denote the combination of RPULL with a simultaneous execution of the 
classic PUSH protocol. The restriction of a single node to answer only a limited number of requests does 
not limit the progress of the PUSH algorithm when disseminating a rumor. Hence we deduce the following 
corollary. 

Corollary 3.7. W.h.p., for any set of informed nodes S C V, T = O (y log n) rounds of PUSH — RPULL 
stochastically dominate a single round o/PUSH — PULL. 

To reduce dependencies between nodes which request from the same neighbor we introduce a new 
algorithm VPULL (virtual pull), which we let run for T + 1 rounds and which, in any of those rounds, 
is strictly inferior to RPULL - except for some rare cases that, w.h.p., do not arise. Note that VPULL is 
only introduced as a tool to analyze the algorithm RPULL; hence difficulties/impossibilities that arise in an 
actual implementation of VPULL are not relevant. The proof of Theorem 3.5 is then split into two parts: 

1. Lemma 3.9: W.h.p., T rounds of RPULL stoch. dominate (T + 1) rounds of VPULL, 

2. Lemma 3.12: (T + 1) rounds of VPULL stochastically dominate one round of PULL. 

3 A set F C S is called monotone if A £ F and A <s B implies B £ F. 

4 Figure 2 in Appendix A can be easily used to verify this. 
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Then Theorem 3.5 follows from the transitivity of the stochastical dominance relation. 

By RPULLy we denote the (randomized) process RPULL which runs for T rounds, by VPULLy+i 
we denote the process VPULL which runs for T + 1 rounds and by PULLi we denote the process PULL 
which runs for one round only. The random variables ,S’y PLLL , p y LL and ,S' PULL denote the respective sets 
of nodes that arc informed after the corresponding number of rounds. The processes RPULL/, VPULLt+i 
and PULLi arc not completely characterized by the random variables S^ PULL , ,3') : ' pl | ;LL and S' PULL - one 
has to include information about all requests and messages, that arc sent by all nodes, to fully describe the 
random processes. Nevertheless, to show the desired result, it is sufficient to find a monotone coupling 
where condition (1) and (2), respectively, arc fulfilled with regal'd to the subset relation of the set valued 
random variables S' PPULL , S'^ PI I JLL and S' PULL . 

Definition of VPULL 

An execution of VPULL consists of two phases. In the first phase nodes send tokens instead of the actual 
rumor and w.h.p. nodes who have received a token in the first phase are informed at the end of the second 
phase. In an execution of VPULL we let X v (t) be the number of tokens which node v has sent up to round 
t. In a specific round t denote with R v (t) the set of nodes requesting from some informed node v £ S and 
with r v {t) = \R v (t) \ its cardinality. R v , r v and X v are random variables which describe certain properties 
of an execution of VPULL, where large values of X v or r v indicate the unlikely case in which the (strict) 
monotonicity of the coupling might break. 

Let us also define weakly connected nodes u £ U as nodes for which ds(u)/d(u) < 1/2 and strongly 
connected otherwise. Let 

I< = 0 (^ + log n y T' = 0 ^ lognj and T = @(T'), T > T 

Definition 3.8 (Good, Bad Execution). A T-round execution of VPULL is called a bad execution if for 
some v £ V or 1 < t < T it holds that X v (t) > K or r v (t) > I\, other-wise it is called a good execution. 

First, we describe the algorithm informally. An execution of VPULL is split into two phases - the first 
phase consists of T rounds and the second phase of one round. In the first phase an uninformed node requests 
the rumor uniformly at random from one of its neighbors and an informed node v decides with probability 
fj whether to send out a token - in which case it selects, uniformly at random, one of its incoming requests 
as destination for the token. Nodes that get a token in those T rounds, stop requesting from neighbors, but 
are still unable to forward any information to neighbors in consecutive rounds. In round T + 1 the limit 
to the number of requests that can be served by an informed node is stripped away. Then, in case of a bad 
execution all actions from the first T rounds are discarded and all uninformed nodes perform one round of 
PULL. In case of a good execution all uninformed strongly connected nodes perform one round of PULL 
and afterwards all nodes holding a token arc being informed. If we assume that tokens are as valuable as 
the information itself, in each of the first T rounds, VPULL differs from RPULL only in the fact that the 
selected incoming connection is established with probability fj whereas it is established deterministically 
in RPULL. For an uninformed node u G U, that chooses to request a neighbor v G S, this normalizes the 
probability to get a token to 1/T', independent of the amount of other requesting nodes. Except for round 
T + 1 this algorithm is clearly dominated by RPULL. 

A formal definition is given by the following pseudocode where the parameters K and T' arc defined 
as above. Note that the variables X v , R v , r v , BE V and BE can either be understood as random variables 
describing an execution of the VPULL algorithm or they can be updated directly in the algorithm as done 
below. Except for line 16 which uses global knowledge VPULL can be seen as a distributed algorithm. 
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Algorithm 1 One (T + 1)-round execution of VPULL 

Input: K - threshold for bad execution; T' - parameter to normalize probabilities 

States: informed; uninformed 

Oracle knowledge: ds(v) for every node v; BE := \/ veV BE V 

Variables: R v set of nodes requesting from v in the corresp. round (r v := \R, : \) 

BE V boolean indicator for bad execution caused at node v 

tokenReceived,. indicates whether a node will be informed after T rounds 
1: BE V false; tokenReceived v t— false 

2: for T rounds do 
3: switch state v do 

4: case uninformed 

5: if tokenReceivedy = false then 

6: send request for rumor uniformly at random 

7: if msg = token then 

8: tokenReceivedy true 

9: case informed 

10: if r v > K or X v > K then // bad execution has been detected locally 

11: BE V <— true 

12 : else 

13: with probability r v /T' do 

14: send token to uniformly at random chosen node in R v / 0 

15: Xy i Xy 1 

// Round T + 1: 

16: request (BE, ds(v)) from global oracle 

17: if BE = true then //bad execution has been detected globally 

18: execute one round of PULL // i.e., informed nodes inform all requesting neighbors 

19: else 

20: if ds(v) / d(v) >1/2 then // node is strongly connected 

21: execute one round of PULL 

22: else if tokenReceivedy = true then 

23: state v <r- informed //node learns rumor 


W.h.p. Monotone Coupling between RPULL and VPULL 

We generate first a coupling between RPULL and VPULL. In more layman terms imagine a (random) 
binary string a that contains all the information to generate either process in such a way that the informed 
nodes S'^Y LL are a subset of 5'^ PULL for almost all strings a; actually the probability that cr is chosen in a 
way that S^YY LL is not a subset of S’!, 111 " 1 " is less than n~ c . 

The coupling works in the following way. For each round t £ {1,...,T + 1} and each node u we gen¬ 
erate some random values s u (t), s' u (t) in [0,1]. If u is uninformed (in either algorithm) and has not received 
a token at the beginning of round t then s u (t) is used to determine which neighbor a contacts, otherwise (if 
u is informed) s u (t) is used to select to which requesting node (if any) a token or the information, respec¬ 
tively, is handed over. In VPULL, s' u (t) is hereby used to determine whether v does send out any message 
at all, confer line 13 from Algorithm 1. Clearly, a node u that is provided with a token in VPULL in any 
round t < T is then also informed in RPULL. For round T + 1 in VPULL the values s u (T + 1) are used 
to simulate one round of PULL for any node that is required to do so, as stated in the VPULL algorithm. 

We claim that, w.h.p., s u (T + 1) is not used in the execution of VPULL for any node u that does not 
get informed in RPULLy, which thus implies that, w.h.p., 5^YY LL — S^ PULL - 

Lemma 3.9. RPULLj' stochastically dominates VPULL/| ] with high probability. 

Proof. Under the assumption that tokens arc as valuable as the information itself we constructed a monotone 
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coupling of Sy PULL and S' PPULL . Now, it is sufficient to prove that in round T + 1 of VPULL, w.h.p., no 
node is informed, that has not been informed in the T rounds of RPULL: If neither ever any value r v nor 
any X v exceeded I \, then only strongly connected nodes simulate one round of PULL in round T + 1 of 
the VPULL algorithm. We claim that each strongly connected node has been informed in the first T rounds 
of RPULL. 

A strongly connected node u 6 U requests from an informed node v E S with probability at least 
1/2. In any given round due to Markov inequality with probability at least 1/2 no more than 2A/6 nodes 
vl E U connect to v. The probability for u to get informed under RPULL is thus at least . Choosing 
T = 0(f logn) big enough and a union bound gives us that, w.h.p., all strongly connected nodes are 
informed in process RPULLy. 

To conclude, we prove that w.h.p. neither r(v) nor X(v) exceed K for any node v during an execution 
of VPULL. Let 1 < k < ct,t' be constants, K' := y + log n and K = ct,t'K'■ 

W.h.p., r v < K in VPULL T for all v. For a fixed informed node v, in expectation, no more than =/ nodes 
can request from v. Using a Chernoff bound for a single round and a single node, P (r v > j- + k log n) < 
n -©( K ) holds. With a union bound over all nodes and all rounds and n large enough we obtain that, w.h.p., 
r v never exceeds kK' and therefore neither K. A union bound over all nodes concludes the proof. 

W.h.p., X v < K in VPULL T for all v. For a fixed v, note that, w.h.p., in a single round no more than kK' 
nodes request from v, and therefore, X v is increased at most with probability kK' /T' in any round. Over T 
rounds, in expectation, no more than k K' , increments of X(v) happen, and again a Chernoff bound gives 
us that X v does not exceed 2nK' ^ with high probability. Choosing c.tt> = 2kjj and a union bound over 
all nodes concludes the proof. □ 


Stochastic Dominance between VPULL and PULL 

In a single round of PULL a node a E U is informed with probability , independently from which 
other nodes arc informed. For T + 1 rounds of VPULL we can show that a node is informed at least with 
the same probability and independently from which other nodes get informed, as claimed in the next lemma. 
Afterwards, we prove that Lemma 3.10 is sufficient to deduce the stochastic dominance of VPULL / + i over 
PULLi. For u € U and random process X, let C;/ be the set of all conditions of the type v €E X or v ^ X 
where v / u. 

Lemma 3.10. In VPULLj^ | a node u £ U is informed at least with probability , independently 
from which other nodes are informed, i.e., for all sets of conditions I C C7 ULL and J C C PULL with 
P(/),P(J) > 0 the following holds 




> 


ds(u) 

d(u) 


= p 


(u G Sf 


-pull\ 


= p 


(« € 5', pull j) . 


(3) 


Proof. If u G U is strongly connected, the result holds because VPULL executes one round of PULL for 
u in either way. In a bad execution, VPULL executes one round of PULL for any uninformed node and 
the claim holds trivially. Thus assume that u is weakly connected and we arc in a good execution. Let 
s = ds(u) and Ns(u) = {ni,..., be the neighbors of u in S. We call a node v G Ns(u) busy w.r.t. u 
in round t if it informs some node other than u. Let y t be the number of busy nodes in round t w.r.t. u. In a 
good execution (which we denote by G), any node in Ng(u) can inform at most K nodes and hence there is 
the following constraint on the sum of all yfs 


T 

vt< s ■ K - 


t =i 


(4) 


12 



We can ignore conditions in I corresponding to nodes which do not have a common neighbor with Ng(u) U 
{//} because u can only get the rumor directly through S. The only negative effect on the probability that u 
gets informed by the conditions in I can be captured by the number of busy nodes w.r.t. it. However, since 
the number of nodes which are informed per node in a good execution is small compared with T, there are 
sufficiently many rounds with sufficiently many non-busy nodes to inform it. More precisely, if u requests 
from a non-busy node it is informed at least with probability . Thus, the probability that it, conditioned 
on I A G with P(f A G) >0, is not informed is smaller or equal to (with c = T/T') 


T 


n 


s ~ yt \ < 
d(u) ■ T') ~ 


( _ s ( X t) 

l d(u) ■ V 


T 



K\ s 

T ) d(u) < 1 _ 


ds(u) 
d(u ) 


The first inequality holds because under constraint (4) the expression on the left hand side is maximized for 
yt = The last inequality holds due to -jfFj < 1/2, c(l — K/T ) > 2 and the fact that e~ 2x < 1 — x for 
any x £ [ 0 , 1 / 2 ], □ 


The following result is due to Holley [15] and provides a sufficient criterion for stochastic dominance if 
the measures are chosen accordingly, e.g., as in the proof of Lemma 3.12. 

Theorem 3.11 (Holley Inequality, [ 5]). Let ( S , <) be a distributive lattice and let fi i, //2 be measures on 
this lattice. The Holley criterion is satisfied if 


gi( 4 nB)g 2 (^UB) > ni(A) fi 2 (B) holds for all A, B € S. (5) 


If the Holley criterion is satisfied for //1 and //9 then 

E ci wm > 112 (A) f (A) holds for all increasing functions f : S —> M. ( 6 ) 

Aes Aes 

Lemma 3.12. VPULLji+i stochastically dominates PULLi. 

Proof For the proof let U be those uninformed nodes u with 0 < ds(u) < d(u ) and consider the distributive 
lattice ( S , ^ 5 ) = (2 U . C). Every uninformed node u which is not contained in this redefined U has either 
no connection to S at all, i.e., it is not informed in either process, or ds(u) = d(u) holds, i.e., it is informed 
with probability one in either process because also VPULL executes one round of PULL for it. Hence it 
is sufficient to show stochastic domination of iSjW LL over ULL restricted to this redefined set U. This 
choice of U provides 0 < P(S'f ULL = .4). P(,S') ,| [ n l ,LL = .4) < 1 for all 4 G <S and we define the strictly 

positive measures p±(F) := P (^5'^ 1 /Y LL £ F^j and / 12 (F) := P (5f ULL £ F) for F C 2 U . For A in 

S = 2 U , x £ U define A x := A U {x} and A x := H\{x}. The proofs of the follow claim is based on 
Lemma 3.10. 

Claim 3.13 (Quotient Rule). 

^ Ll \ A \ > holds f° r 0,1 A,B eS. (7) 

(A x ) ~ p 2 (B x ) 

At first note that for any C £ S it holds that 

P(X p = C x \X p £ ({ C x } U {C x })) = P(x £ X p ) = d ^±. (8) 

This is true because in PULL x is informed independently of what else is happening and due 
to the fact that we already condition on X p being either C x or C x , hence the probability of 
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X p being C x depends solely on x being informed. Second, recall that // ] and p 2 are strictly 
positive measures, so pi(C), [ 12 (C) > 0 for every C G S, even C = 0, U. 

Let A, B G S and x £ U. Then Lemma 3.10 implies 

> P(X p = A x \X p e ({A x } u {A*})) 

P 2 (B x ) 

~ m(B x ) + p 2 {B x ) 

< H2(B x ) + H2(B x ) 

- P 2 {B x ) 

< P2 (B x ) 

- H2(B x ) 

> V2(B X ) ' 

/A (/>’,•) ’ P 

Claim 3.14. The quotient rule (3.13) implies that the Holley criterion is satisfied for jj\ and p 2 . 

Let A, B 6 S and C := A\B = {ci,..., c r }. The Holley criterion is trivially fulfilled 
if A C B\ hence assume otherwise, which implies r > 1. Write C s := {ci,..., c s } for 
1 < .s' < r. By a telescoping argument we obtain the following. 

Pi(AaB) _ /ji(BuC) Ti{B U C r _i) /xi(BUCi) 

pi(B) w(BUC r _i) pi(Bl)C r -2 ) di{B) 

Applying (7) to each fraction we obtain 

ii 2 ((Ar\B)\jC) fi 2 {(AnB)uc r -i) p 2 ({AnB)uc 1 ) 

~ p 2 {{A nB) uCr-i) ’ p 2 ((A nB)uC r _ 2 ) . fj, 2 ((A n b)) 

= d2{A) 

p 2 (AnB)' □ 

The proof of Lemma 3.12 then follows with Claim 3.14, Theorem 3.11 and the definition of the expected 
value of an increasing function / : S —> M. □ 

Proof of Theorem 3.5. The proof is a direct combination of Lemma 3.9 and Lemma 3.12. □ 

3.3 The Round-by-Round Analysis is Tight 

Lemma 3.15. The time bound T = 0(^ log n) from Theorem 3.5 is tight. 

Proof. In order for T rounds of RPULL to quasi-dominate 1 round of PULL, any node v must get informed 
in T rounds of RPULL with at least the same probability as within one round of PULL (or w.h.p., if the 
latter probability equals one). We construct a graph G for which at least T = log n) rounds of RPULL 
are necessary to guarantee this. A picture depicting the graph can be found in Appendix A. 

We partition set V into V = AldHUTpU ... UT fc 2 fc 2 , where A = {ai,..., cq. 2 }, B = { 61 , ■ ■ ■ ,b k 2 } 
and k = n 1 / 5 . A and B form a complete bipartite graph with edges running between A and B. For each 

5 The proof is adapted from [14, chapter 2, page 24], 


P(X VF = A*|X VP e ({A*} U {Ar})) 

= P(X p = B x \X p G ({B x } U {£*})) 

Pi(A x ) 


pi (A* 
Pi (A* 


+ pi(A x ) 
+ pi(A x ) 


pi(A x ) 

pi ( A x ) 


Pi(A x ) 
Pi(A x ) 
pi ( A x ) 
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j E [. k 2 ], node b, is connected to one node tj j E T, j. Each T, j forms a complete graph of size k. In this 
graph, 5 = k — 1 (acquired in Tj.j) and A = 2k 2 (nodes in B), and therefore A/<5 E 0(/>:). The total size of 
the graph is | Vj = n + o{n). Initially, we let So = B. 

In this graph, within one round of PULL, all nodes of A arc informed with probability 1. Now, consider 
the same graph after m < k 2 /2 rounds of RPULL and let us assume that some node a E A is still 
uninformed. It requests in this round from some node b t . Let X, be the number of requests at b{. Within 
m rounds, each node bi managed to inform at most m of its neighbors from NB, := ..., f ?m }. Since 

m < k 2 / 2, at least half of all nodes in NB, are still uninformed and thus, since they have degree k, EfX.,] > 
k/2. Applying Chernoff, we get that w.h.p., X, > k/A. In this scenario for a the probability to be chosen 
over one of its competitors is at most 4/k, regardless of m, and therefore, P(a E *S'^ PULL ) < 1 — (f . 
Lor this to exceed 1 — 1 /n,m has to be in 0 (4 log n ). □ 

4 Conclusions 

Lemma 3.15 and Theorem 3.5 show that to simulate one round of PULL, 0(^ logn) rounds of RPULL 
are required. However, in case one wants stochastical dominance (w.h.p.) over T > 1 rounds of PULL, 
the lower bound proof of Lemma 3.15 does not hold anymore. We believe that for T = Q(log n), on any 
graph G and any set of initially informed nodes S C V, 0(T(j- + log n)) or maybe even O (T( y) + log n) 
rounds of RPULL suffice to stochastically dominate T rounds of PULL. That proving this assumption 
might be a challenging task is underlined by a similar conjecture in [ 1 ], in which the authors do a coupling 
of synchronous and asynchronous PUSH-PULL. They obtain a similar multiplicative O(logn) factor and 
also conjecture that it can be improved to an additive 0(log n) term. 

A possible alternative restriction of the PUSH-PULL protocol could be given by the following algo¬ 
rithm. In each round, every node requests from an outgoing neighbor chosen uniformly at random. At each 
node, one of the incoming requests is chosen (e.g., uniformly at random) and a connection to the requesting 
node is established. Linally, over all established links between an informed and an uniformed node, the 
uninformed node learns the rumor. Note that unlike in the restricted PUSH-PULL valiant described in our 
paper, here, also two informed nodes or two uninformed nodes could be paired. Such a PUSH-PULL vari¬ 
ant can be analyzed in an analogous way to our analysis of the RPULL protocol and it can be shown that 
()(f log n) rounds of this algorithm stochastically dominate a single round of the regular PUSH-PULL 
protocol. 
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Appendices 


A Pictures 

A.l Picture for Lemma 2.3, Figure 1 



Figure 1: Picture for Lemma 2.3, proving that random and adversarial RPULL have exponentially different 
running times on general graphs. Grey areas indicate fully connected parts of the graph. 

In random RPULL the node ry learns the rumor within 0( log 2 n) rounds and can spread the information 
through the graph in polylogarithmic time. 

In adversarial RPULL the adversary prevents ry from learning the rumor by always disseminating the 
rumor to one of the requesting nodes of in D \,..., D rn in every round. We can show that the number of 
informed DiS grows slowly and hence such requests exist w.h.p. as long as no node in I)q is informed. Also, 
with only few D,s informed, due to their high degrees, leaf nodes in Ly are unlikely to request from a 1), 
containing the rumor, and hence the progress of rumor propagation is stalled. 
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A.2 Picture for Lemma 3.15, Figure 2 




Figure 2: Picture for Lemma 3.15, proving that 12(4 logn) rounds of RPULL arc necessary to simulate 
one round of PULL. 

All not tilled circular nodes in the bipartite graph in the top (B) have the rumor at the start of the 
execution. In one round of PULL all tilled nodes in the bipartite graph ( A ) learn the rumor with probability 
one. In random RPULL logn) rounds arc necessary to inform these nodes, because for each of the 
nodes in A the probability to be informed in one round of RPULL is in ©(1/A;). This is due to the high 
number of requests from nodes in T r J each round to the informed nodes in B. 


B Chernoff Statement for Lemma 2.2 

Lemma B.l. Let X \, X- 2 ,..., X n be independent geometric random variables with X t ~ Geo(p t ) for 
i = 1,..., n and 0 < pi < p 2 < ... < 1. Let X = l A/. Then 

P{X>3(p + t)) <e-^^- pit . (9) 

Proof. Let 7 = — ln(l — ^) > 0. Because of pi < pi this implies e 7 • (1 — pf) < 1 for all i = 1,..., n. 
We need this condition at (*) in the proof of the following claim. 

Claim B.2. E[e 7 ' Yi ] < 1 + ^. 

With a straight forward calculation one obtains 
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Since x h+ e rx is an increasing function for 7 > 0 we obtain 


P (X > 3(/i + t)) = P (e^ x > 
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which proves the actual result. 


(Markov) 
(claim 1) 
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